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ABSTRACT 

A Topic Map is a structured network of hyperlinks that points into an information pool. Topic Maps have an 
existence independent of the information pool and hence different Topic Maps can form different layers above the 
same information pool and provide us with different views of it. We explore the use of Topic Maps with the Unified 
Column Descriptor (UCD) scheme developed in the frame of the ESO-CDS data mining project. UCD, with its 
multi-tier hierarchical structure, categorizes parameters reported in tables and catalogs. By using Topic Maps we 
show how columns from different catalogs with similar but not identical descriptions could be combined. A direct 
application for the Virtual Observatory community is that of merging catalogs in order to generate customized views 
of data. 

Keywords: Virtual Observatory, Topic Map, XML, Large Datasets, Knowledge Organization, Data Discovery, 
Semantic Nets 

1. INTRODUCTION 

Over the last couple of years several VO groups have been making steady progress in defining and prototyping various 
access and interconnectivity tools. It is clear that not only will the VO be internet-based, but also that it will be 
very XML-heavy. The most important feature of the VO will be to allow access to desired information in a quick 
and systematic way. All the metadata - handle to the real data - will populate different XML files. XML is highly 
structured and very readable. However, it does not offer itself to searches trivially. A Topic Map (TM) - an XML 
file itself - can form a layer of hyperlinks over an information resource and serve as the GPS in a huge database. 
With TMs one can define complex knowledge structures and attribute them as metadata to information resources 
allowing one to systematically organize knowledge on a variety of data subjects such that the retrieval and sharing 
with other users is easy. 

As an analogy from everyday life, a Topic Map is like the index at the back of a book. Like an index, a Topic 
Map presents an array of subjects, along with links to the occurrences of information, which for a book are page 
numbers. A book index can have entries in italic (indicating, for example, the presence of an image on page so and 
so), or in bold (the main explanation), and links to other parts of the index (for 'Active Galaxies', see 'Galaxies, 
Active'). But Topic Maps also provide different kinds of indexes for different purposes, for different users. A TM 
links all the topics in a given area, letting users find the information they want more transparently by navigating 
through the concepts of the subject area, rather than having to know the underlying way in which the information 
has been organized and stored. 

Using Topic Maps to interlink datasets goes one step ahead than existing tools and services for two reasons: (1) 
users will be able to customize Topic Maps to the level that they desire, and (2) new VO services that are developed 
elsewhere and by other teams will not only be instantly linkable through Topic Maps, but also the Topic Map indexing 
and querying mechanisms will allow a seamlessly merged appearance of these services allowing for more semantically 
coherent knowledge discovery. 

In this paper we provide an outline of what Topic Maps are and where the Topic Map technology is heading. We 
then describe the Uniform Column Descriptors (UCDs) and go on to discuss the Topic Map made from the UCDs. 
We then describe the Topic Map engine we have implemented to allow users to create their own UCD Topic Maps 
and merge those with the ones we have constructed. Finally we present sketches for more Topic Maps which can be 
used as VO tools. 
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2. TOPIC MAPS 



Topic Maps started life as a standard for software documentation in 1991 (HyTime Hypertext 1991 conference in 
San Antonio). However, that was much before the time was ripe for the idea. It was only when XML matured that 
their potential in information and knowledge organization has been realized. In 1996 Topic Maps became a work 
item in ISO's SGML working group resulting in a Topic Map standard published in 1999. This has since then led to 
different working groups on XTM, TMQL and so on and the progress has been rapid. More history can be found in 
Pepper(1999). 

A Topic Map is a collection of topics linked together by associations between the topics. The topics can occur 
in different contexts (scopes) and the associations qualify the occurrences. In the map, each node is a named topic. 
Associations between nodes are expressed by links. A Topic Map thus is a structured network of hyperlinks above an 
information pool from which the topics are drawn. The named topics can be just about anything thereby allowing 
a Topic Map to discuss abstract relationships between different topics. Depending on the situation, different terms 
could be chosen to be regarded as topics. In the following sections we will describe in detail the UCD Topic Map. 
For a more detailed, general introduction to Topic Maps, please see Mahabal et al.(2001). 

The real power of Topic Maps comes from the scoping ability. Scopes can be used to categorize different occur- 
rences of a given topic depending on context. A good way to look at Topic Maps with their scoping ability is the 
following analogy: When searching for something on the internet search engines, one gets a lot of muck in which the 
real gems are hidden. If proper keywords are used, and if the search engine does not index everything, the gems float 
to the top. This is exactly what happens when the scoping mechanism is used for discrimination and for qualifying 
context. It limits the validity of a topic by categorizing it into one or more themes. Thus scoping allows one to zoom 
in not only on different topics but also very specific subcategories e.g. a quasar could be scoped to be one with a 
binary companion, and also one with a redshift exceeding 5. It becomes very useful when, for example, searching for 
specialized entities like high redshift quasar clusters. 

In general, a Topic Map can be looked at in many different ways: 

• As a semantic network over an information pool. Though a Topic Map connects (associates) different con- 
cepts/subjects (topics) within an information pool, it has an existence distinct from the information pool itself. 
This network of hyperlinks can be used in its own right as a resource. 

• A configurable data interconnections viewer. Besides being a network of hyperlinks, the Topic Map is also 
capable of rendering itself as a configurable viewer of the data resource it is talking about. This comes about 
because of the possibility of scoping which allows one to prune the Topic Map for viewing purposes in well 
defined complex ways. 

• Basis for structured querying of XML files. This aspect of Topic Maps can not be overstated. Topic Maps are 
wonderful for structuring and displaying information. In Sec. 6 we describe Topic Map querying mechanisms. 

• A data/knowledge discovery tool. This is the aspect we want to stress here. With the Virtual Observatory 
wanting to deal with extremely large datasets, we need tools that will help us make sense of the multidimen- 
sionality (table- widths) as well as the multi-trillion sizes of individual datasets (table- lengths). And then we 
would want to combine the datasets meaningfully. What better way than to do that while maintaining the 
semantic structure? Topic Maps allow for the diverse data to be meaningfully combined. 

The mathematical model that can be used for Topic Maps is that of a hypergraph. Topics form the vertices, asso- 
ciations are the edges and occurrences form the incidences. A hypergraph representation consisting of neighborhood 
of a vertex can be found at KartocQ. An alternate representation of a hypergraph, a bipatriate graph, can be seen 
as part of Mondeca's topic map softwartQ. More details can be found in Auillans et al.(2002) and Auillans(2002). 

'http://www.kartoo.com 
' http://www.mondeca.com 



3. UNIFIED COLUMN DESCRIPTORS 



VO is about data: access to data, pipelines for data, processing of data, interoperability and so on. The data itself is 
the main component and is in the form of images, spectra, catalogs etc. An important component of all these is the 
metadata. UCDs are a first step in bringing about uniformity in the catalog metadata - the different column names 
that authors use. ESO and CDS have worked together to form a 4-tier hierarchical set of standardized column names 
- called Unified Column Descriptors - to categorize all the different column names that are used in astronomical 
catalogs. There are 35 main categories including Photometry, Positions, Physical Quantities etc. We have made 
use of the 1400+ UCDs as our Published Subject Indicator (PSI) list in generating a template for the Topic Map 
we present here. Should the PSI evolve to a different set of names, regenerating the Topic Map to reflect that is 
straightforward. More details about UCDs can be accessed at the Vizier websitqj, Ortiz et al. (1999) and elsewhere 
in these proceedings (Genova et al., 2002). Some examples of UCDs are: 

• PHOT_FLUX_IR_12: This UCD is used for columns which represent "Flux density (IRAS) at 12 microns, or 
around 12 microns (ISO at 14.3)". This pulls together different names that different astronomers have used for 
this quantity e.g. C, F12, F12umEst, FLUX12, Fnu_12 etc. (see Fig. 3). 

• POS_EQ_RA_MAIN: This UCD is used to describe the Right Ascension and combines various names used over 
years and epochs e.g. RA(ICRS), RA1855, RAB1900, RAB1950, RAhms, RAJ2000 etc. 

4. UCD TOPIC MAP 

A UCD Topic Map is constructed from the metadata (column names and units) for a set of tables (catalogs). It does 
not concern itself with the actual data in the catalog. Their sheer number can be overwhelming and that problem 
can anyway be tackled outside separately. Instead, the UCD Topic Map revolves around Columns, Column Names 
and Tables. Thus, the main topics include: 

• all 1400+ UCD names, 

• the corresponding UCD descriptions, 

• the column Names from the Tables used in the Topic Map, and 

• the Tables themselves (any set of tables can be used to make a Topic Map). 

In addition, there are several other topics owing to the hierarchical nature of the UCDs (e.g. the depth of a UCD 
in the list), units associated with different columns and any number of external links per Table (e.g. a hyperlink 
to the actual table somewhere on the internet) or even per Column (e.g. a histogram for the column, or quantilcs 
obtained through external CGI programs). It is these external links that provide a great deal of flexibility to the 
Topic Map. 

In the following few paragraphs we will be switching between descriptive and visual views of different aspects of 
Topic Maps. We show in Fig. 1 the top view of the Topic Map as seen in Ontopia's freely available OmnigatoJ§ 
The view shows the different subject indexes (explicitly defined topics), relationship indexes (defined associations), 
role indexes (the roles played by the topics in the associations) and resource indexes (external resources e.g. links 
to tables). As indicated before, the Topic Map is a network of links allowing one to jump from a general topic (e.g. 
UCD) to a particular instance of it (e.g. the UCD PHOT_FLUX_IR_12) to the Table in which this UCD occurs (e.g. 
the catalog of Seyfert Galaxies). The display with UCD as the central topic is captured in Fig. 2 while Fig. 3 shows 
a view with PHOT_FLUX_IR_12 being the central topic and Fig. 4 shows an HST related table to be the central 
topic. Each of these views is rich in semantic information. 

For the UCD Topic Map we have defined very basic associationships between the different topics. Some examples 

are: 

• UCD occurs in Table, 



http://vizier.u-strasbg.fr/doc/UCD.htx 
http: / /www. ontopia.net 



UCD Topic Map 



Subject Indexes (5) 



• table 

• ucd 

• ucddesc 

• unit 



Relationship Indexes (8) 



* ColDescOccursIn 

* hasColumnAssoc 

* hasDescription 

* hasUnits 

* isParentUCD 

* UCDDescOccursIn 



• UCDOccursIn 

• UnitOccursIn 









Role Indexes (5) 




• coldesc 

• table 

• ucd 

• ucddesc 

• unit 






Resource Indexes (1) 




• exttable 





Figure 1. This is the top view of the UCD Topic Map. The main topics, associations, roles played by topics in 
associations are seen. The main topics are like common nouns and they denote various topic types that other topics 
have. Relationship Indexes are the different associations that we have predefined. A user who is more interested in 
some other associationship can easily get that added through the merging mechanism described in Sec. 5. The Role 
Indexes denote the roles played by the main topics in the associations. In our Topic Map the main topics do not 
play multiple roles and hence the Subject Indexes and Role Indexes lists are identical. Finally, we have illustrated 
the use of a single external resource viz. exttable i.e. a hyperlink to the actual table. As elaborated in the text, this 
will actually be a long list of external links pointing to varied VO resources including table merging, astrostatistics 
etc. 

• UCD has description Column Description, 

• Column (of Table) has units Unit, 

• Column occurs in Table, 

• UCD is parent o/UCD. 

Similarly, the occurrences in the UCD Topic Map can be grouped into a small number of sets: 

• UCDs occur in Tables, 

• Columns occur in Tables, 

• Units occur in Tables, 

• Tables occur as external links in many places. 

If one is dealing with a single table, one encounters several columns, each column with its corresponding UCD 
and a few of the columns having units. There are also several tens of associations and occurrences. Together they 
form a structure which is graspable without any sophisticated arrangements. The HST related view in Fig. 4 is 
an example. It is when one goes to tens of catalogs that the complexity increases rapidly and a more advanced 



Topic Map Overview 



• Ontology 

• Master Index 

• Themes 

• Class Hierarchy 
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* ucd 
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• AT 

• AT_ELEMENT 

• AT_WAVELENGTH 

• CLASS 

• class_code 
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• class_star/galaxy 

• CODE 

• C0DE_ERROR 
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• FIT_TYPE 



Figure 2. Topic Map view centered on the topic UCD. A partial list of UCDs is visible. Each is a clickable 
hyperlink pointing to a separate page with occurrences of that UCD. Selecting different UCDs will allow the user to 
explore different regions of the data resource. 



PHOT FLUX IR 12 



• PHOT FLUX IR I? 



Subject indicators (1) 



► http:/A^ww. astro .caltech.edu/"aam/Ecience/topicmap^UCDTopicMap.ht.rnl#TOPICPHOT_FLUX_ 



Related subjects (17) 



• associated column 

o C 
o F12 

o F12umEst 
o FLUX 12 
o Fnu_12 

• child UCD 

o PHOT_FLUX_IR 

* gen. description 

o Flux density 
(IRAS) at 12 
microns, or 
around 12 
microns (ISO at 
14.3) 

* Table 

o Catalogue of 
Seyfert Galaxies 
(Lipovetsky+, 
1988) 

Far- infrared 
fluxes 

(Vll/l73/table3a) 
o IRAS catalogue 
of Point Sources, 
Version 2.0 
(IPAC 1988) 
IRAS Point 
Sources 
(ll/125/main) 
o IRAS Faint 
Source Catalog, 



1 External resources (1) 


* exttable 






o http:/A'izier. 
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Figure 3. A separate page for the UCD PHOT_FLUX_IR_12 is depicted here. With it are seen the different tables 
it occurs in and the external links for those tables. Under "associated column" one can see the different names that 
different astronomers have used to denote the flux density at and around 12 microns. 



HST Archived Exposures Catalog (STScI, 2002) Merged log of HST Observations (B/hst) 
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• HST Archived Exposures Catalog (STScI, 
2002) Merged log of HST Observations 
(B/hst) 



Subject indicators (1) 
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Figure 4. Typical page centered on a Table. The table "HST archived exposures catalog" is seen here. Also seen 
are the different columns in the table, associated UCDs, and units when available. 

knowledge organization becomes necessary. In the UCD Topic Map we present here0, we have used the 100 most 
frequently accessed catalogs at Vizier. A partial list of the catalogs is shown in Fig. 5, and the resulting number 
of topics, associations etc. are partially sketched in Fig. 6. It is with these large numbers that we can start asking 
interesting questions. This is where the data/knowledge discovery begins. 

In case of a single table one has the table as the main topic with different columns present in the table forming 
other topics, as also the UCDs associated with these columns and the Units associated with some of the columns. 
A schematic of the scenario is presented in Fig. 7. When we move to a mult it able UCD Topic Map, and select 
any UCD as the main topic, we can see all the associated tables (in which the UCD is present). Depending on 
one's interest, one can then start asking specific questions. One such scenario is summarized in Fig. 8. Consider 
an astronomer interested in the infrared part of the electromagnetic spectrum, in particular the 12/im flux density. 
She asks for tables (within the 100 most accessed tables in the UCD TM) that have a column which corresponds to 
the UCD PHOT_FLUX_IR_12. She is returned a list of such catalogs: Catalog of Seyfert Galaxies, IRAS catalog of 
Point Sources, the extended 12/zm galaxy sample and so on. Each of these takes her to that table allowing her to 
explore different columns present in those tables (e.g. which area has been covered, how faint they go, which epoch it 
is, what the resolution is and so on). In addition, the Topic Map author can choose to add different scopes to each of 
the table to, for instance, convey if the catalog is galactic or extra-galactic. Such information could then be used by 
the user to determine if the catalogs contain columns on which a meaningful join can be performed. A very VO like 
example would be: "choose Elliptical galaxies that are strong in radio, fairly bright in infra-red but do not contain 
an AGN" . By navigating through the appropriate keywords for IR, radio fluxes and catalogs of elliptical galaxies 
and catalogs of non-agn galaxies such a search could be achieved. Additionally the Galactic catalogs need not all be 
thrown away. They can be used to check if there exists a population of objects that can act as a contaminant to the 
population that the user has chosen. 

Thus data discovery is all about asking the right questions: Can I merge X-Ray and Radio catalogs? Which 
ones? Do their units match? What are the parameter ranges? What is the basic statistics for them? How does 
the histogram for a column look and so on. Some of the possibilities mentioned above can also be carried out using 

^http: //www. astro. caltech.edu/'--aam/science/topicmaps/ucd. html 
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Subject indicators (1) 



* http://www. astro. caltech.edu/- aam/science/t'opicniap£/LlCDTopicMap.litrril#TOPICtable 



PL 1 , II 



• 3rd Bibliog. Cat. of Stellar Radial Vel. (Barbier-Bressat+, 1994) Third bibliographic 
catalogue of stellar radial velocities (111/184) 

* AGIO Catalogue (Dieckvoss, Heckmann 1975) Catalog Data (1/51 B) 

* Astrographic Catalog Reference Stars (ACRS) (Corbin+ 1991) Part 1 : more reliable 
stars (l/171/acrs1) 

• Astra graphic Catalog Reference Stars (ACRS) (Corbin+ 1991) Part 2: stars with poor 
observational history (IM7l/acrs2) 

♦ Astrographic Catalogue, Zones -02 to +31 degrees (Roeser 1990) Algiers zone 
(-02,-01, +00,+01,+02,+03, +04) (1/154) 

• Bidelman- Parsons Spectrosco pic/Bibliographic Cat (Parsons+ 1980) First data record 
i - >- ■ i- . 

* Bidelman- Parsons Spectroscopi.c/Bibliographic Cat (Parsons+ 1980) Second and 
subsequent data for the object (VI/32/data2) 

• Bonner Durchmusterung (Argelander 1859-62) Changes since the original printing 
(l/122/bdchg) 

• Bonner Durchmusterung (Argelander 1859-62) The Bonner Durchmusterung catalog 
(l/122/bd) 

* Bright Star Catalogue, 5th Revised Ed. (H offleit+, 1991) Remarks (W50/notes) 

* Bright Star Catalogue, 5th Revised Ed. (H offleit+, 1991) The main part of the Catalogue 
(V/50/catalog) 

• Cape Photographic Durchmusterung (Gill+ 1895-1 900) Changes from the printed cat. 
(l/108/cpchg) 

• Cape Photographic Durchmusterung (Gill+ 1895-1 900) The CPD catalog (l/108/cpd) 

* Carlsberg Meridian Catalogs (CMC, 1999) Observation of FK5 stars, lower culmination 

• Carlsberg Meridian Catalogs (CMC, 1 999) Observation of FK5 stars, upper culmination 
(l/255/fk5u) 

« . 

supernovae (l/256/novamag) 

* Carlsberg Meridian Catalogs (CMC, 1 999) Observed magnitudes of suspected 

variables (l/256/suspect) 

■ '(->■■; 



Figure 5. The (partial) list of tables that constitute the current UCD Topic Map. The individual tables can be 
separately browsed for columns, UCDs, units and external links. 
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Figure 6. A summary of topics, associations, occurrences etc. for the UCD Topic Map made out of 100 most 
frequently accessed catalogs at Vizier. Thousands of topics, associations and occurrences are a clear indication that 
a good knowledge discovery tool is needed for exploring such a rich data resource. 




Figure 7. A single table has just a few columns, UCDs and even fewer units and is easily navigable. 




Extra Galactic 



Figure 8. When multiple tables are put together, the complexity increases manifold. With Topic Maps 
complexity can be harvested to ask interesting questions and get meaningful answers. 



tools available elsewhere (e.g. at CDS). However, segregation of different entities by the scoping mechanism affords 
Topic Maps a stronger semantics allowing a quicker zooming in on the required data. Another aspect is the ability 
to merge different Topic Maps. We discuss it in the next section. 

5. TOPIC MAP MERGING 

Topic Map merging is not the same as merging of columns from different tables to create a database join. Topic Map 
merging involves identifying identical Topic Map topics in different Topic Maps and merging their associations and 
occurrences to give rise to a single merged topic. For doing this, a Subjectldentity tag is used. As a result, one can 
have two topics with the same name if they have different Subjectldentity. This is in fact desirable since we often 
use the same name in different contexts for different entities e.g. satellite could be a general term for the satellite of 
a planet, or it could be a man-made communications satellite. 

Here we are more interested in topics that can be merged. Continuing to discuss our hypothetical astronomer, 
now she is equipped with her own dataset of X-Ray observations which she has put in the standard XML format after 
having identified proper UCDs for the different column names. She would now like to merge the database with the 
readymade UCD Topic Map (of the f 00 most used catalogs) in order to later look for the non-AGN radio ellipticals 
which are also detected in X-Ray. The Topic Map generator that we have built allows precisely this to be done, ft 
takes as input a list of tables and then generates a Topic Map out of it in exactly the same format as the UCD Topic 
Map (the new Topic Map is also a UCD Topic Map, though custom built). The topic map then appears as a new 
entry in the list of Topic Maps available. The merge button available at the top prompts the user to choose which 
Topic Maps are to be merged (see Fig. 9). Once the user chooses the original UCD Topic Map and the new custom 
built TM, the merging is done and a new merged topic map is available for exploring. The cgi-bin form through 
which the topic map generation is carried out is shown in Fig. 10. A flowchart of the merging procedure is shown 
in Fig. 11. The TM generator can also make a Topic Map from just the list of catalog names in the format Vizier 
uses. It downloads the catalogs and any sub-catalogs (currently only from Vizier) and forms the Topic Map using 
the template built into the program. 

6. QUERYING AND INDEXING 

Lastly we talk about two aspects that make knowledge discovery with Topic Maps even more viable. These are 
querying and indexing. We mentioned earlier that Topic Maps are XML structures working on other XML structures 
and that querying XML is not straightforward in the usual way. The ISO community has not formalized a TMQL 
(Topic Map Query Language) yet. However, many precursors like eTMQlH AsTMeEj and Tolo£3 

exist. We found 

Tolog easy to use and it comes bundled with Ontopia's free Omnigator. The syntax used is a combination of an SQL 
like language and the powerful prolog. The use of prolog's power allows one to make extensive searches. Examples 
of some queries are given below. 

• query: List all UCDs 
code: instance-of($A,ucd)? 

meaning: give me all instances that match $A=ucd ($A is a variable). 

• query: Count number of child UCDs 

code: Select $A, count($B) from isParentUCD($A : ucd, $B : ucd)? 

meaning: Count all $B such that $A and $B are both UCDs and $B is the child of $A. 

• query: Order the above in descending order, sorted on count. 

code: Select $A, count($B) from isParentUCD($A: ucd, SB: ucd) order by $B desc? 

meaning: Select $A such that $A and $B are UCDs, count the number of $Bs and put it in inverse numeric 
order. 



"empolis: 
"Bond Un 
^Ontopia: 


http://k42.empolis.co.uk/tmql.html 




iversity, Australia: http://topicmaps.bond.edu.au/astma 


http:/ /www. ontopia.net 





Merge in a topic map 


From the list on the right you 
can select another topic map to 
merge with the current topic 
map (UCD.Jttm). You can then 
browse the topic map resulting 
from the merge of these two. 


ZMASS.xtrn 

HST.xtrn 

nedl .xtrn 

newx.xtrn 

ngcZOOO.xtrn 

xray.xtrn 

XRAY_srnall.xtrn 







Figure 9. The form that allows users to merge different Topic Maps. Associations and occurrences of Topics which 
have identical Subjectlndicators are merged. 

The queries can be executed on individual Topic Maps or on merged Topic Maps. Unlike an SQL query run on 
a database, all the returned entries are hyperlinks too, allowing the user to jump to any of the results and continue 
the data discovery trails. 

Indexing is the other important aspect. Topic Maps began life with indexing in mind. It is not surprising that it 
continues to be one of the strengths of this tool. One can create different indexes per Topic Map so that searching is 
faster. As for queries, searches return hyperlinked documents and also the type of match it found and the confidence 
level of the match. The hyperlinked searchable multiple indexes make a Topic Map a combination of indexes, thesauri 
and glossaries. 

7. DISCUSSION AND CONCLUSIONS 

We have outlined above the construction and use of a UCD Topic Map and how users can create their own UCD 
Topic Maps and merge them with other UCD Topic Maps. While this can already act as a tool for custom viewing 
of data and as a knowledge discovery tool, much needs to be accomplished. We will keep adding bells and whistles 
to the Topic Map pagfl 

A lot of stress has been placed on the external links. This is because Topic Maps need not reinvent wheels for 
actions that other services are capable of e.g. Topic Maps facilitate selection of columns for merging. One could then 
easily use the merging VO service provided by Roy William^] to do the actual merging. Similarly, for statistics of 
columns, one could use the prototype astrostatistic^l service. In passing, it must be stressed that Topic Maps are a 
general knowledge organization tool and many varied Topic Maps can be constructed with the VO in mind. Some 
example sketches regarding literature searches and observing logs were presented in Mahabal et al.(2001). 

We encourage you to explore Topic Maps, generate some of your own, UCD or otherwise. Your comments are 
most welcome. 

t http: / /www. astro. caltech.edu/'^aam / science / topicmaps 
* http: //virtualsky.org/conesearch-xmatch.htm 
^http://avyakta.caltech.edu:8080/cgi-bin/astrostat3.cgi 



VO page for generating Topic Maps 



Input filename with list of catalogs ( format) 



Name for the map (no spaces): [ 

[Optional] filename with description for the map [" Browse... | 



Example: use this file as input to make an X-ray topic map 
Generate the TM 



Last Outdated: 14 Aug 2002 



Figure 10. The cgi-bin form that allows users to generate their own UCD Topic Maps from confirming tables 
(XML format). Such a Topic Map can then be merged with other existing UCD Topic Maps. 
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Figure 11. A flowchart showing the path from individual datasets to data discovery through the medium of Topic 
Maps and their merging. 
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